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Abstract 

There is a fundamental difficulty in generalizing weighted automata 
to the case of infinite words: in general the infinite sum-of-products 
from which the weight of a given word is derived will diverge. Many 
solutions to this problem have been proposed, including restricting 
the type of weights used (see Refs. [9], [10], and [11]) and employing 
a different valuation function that forces convergence (see Refs. [3], 
[6], [7], and [I]). In this paper we describe an alternative approach 
that, rather than seeking to avoid the inevitable divergences, instead 
embraces them as a source of useful information. Specifically, rather 
than taking coefficients from an arbitrary semiring S we instead take 
them from S . Doing this is useful because gives us information about 
how the weight of an infinite word does or does not diverge, and if 
it does diverge what form the divergence takes — e.g., polynomial, 
exponential, etc. This approach has proved to be incredibly useful in 
the field of quantum simulation (see Refs. [18] . |17j and [5]) because 
when studying infinite systems, information about how quantities of 
interest, such as energy or magnetization, diverge is exactly what we 
want. 
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In this paper we introduce a new kind of automaton which we 
call a diverging automaton that maps infinite words to sequences of 
weights from a semiring and which employs a Biichi-like boundary 
condition. We then develop a theory for diverging power series and 
prove a Kleene Theorem connecting rational diverging power series to 
diverging automata. Afterward we repeat this process by introduc- 
ing bidiverging automata which map biinfinite words to elements in 
S ZxN , developing a theory for bidiverging power series, and proving 
another Kleene Theorem. We conclude by describing how bidiverging 
automata are applied to simulate biinfinite quantum systems. 
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1 Introduction 

There is a fundamental difficulty in generalizing weighted automata to the 
case of infinite words: In general the infinite sum-of-products from which the 
weight of a given word is derived will diverge. Of course, one solution is to 
restrict oneself to the class of complete semirings for which an infinite sum-of- 
products is always guaranteed to converge, thereby excluding the possibility 
of divergences altogether (see Refs. j9], [10], and [11] for a description of 
the resulting theory developed in a very general setting); this is a perfectly 
satisfactory approach in settings where one's semirings meet the necessary 
requirements, but it nonetheless places very restrictive conditions on the 
semirings one can use. Another solution is to modify the definition of the 
resulting weight by allowing it to be an arbitrary function of the infinite 
sequence of weights rather than a straightforward sum of the products along 
all paths, and then to pick this function so that it is well-defined for all 
automata and words (for examples, see Refs. [I] and [7]). A specific example 
of the latter approach is the use of a deflation parameter that causes the 
weights of transitions to exponentially decrease in magnitude over the course 
of a run through the automata, thus forcing divergent sums to converge (see 
Refs. [6] , [1] , and [8] for development of the theory and [1] for an application 
of it to game theory). (Note that the use of a deflation parameter can be 



3 



viewed as a special case of the first approach; see §4 of Ref. [10] for the 
details.) 

In this paper we shall describe an alternative approach to handling the 
infinite sum-of-products resulting from the marriage of weights and infinite 
words: rather than seeking to avoid the inevitable divergences, we embrace 
them as a source of useful information. Specifically, we start with an arbitrary 
semiring S, but rather than assigning weights from S to the infinite words, we 
assign weights from S ; doing this is useful because each sequence of values 
from S gives us information about exactly how the weight of an infinite word 
does or does not diverge, and if it does diverge what form the divergence 
takes — e.g., polynomial, exponential, etc. 

This approach has proved to be incredibly useful in the field of quantum 
simulation (see Refs. [18], [17] and [5], replacing the term "infinite matrix 
product state" with "bidiverging automaton" and reducing the level of math- 
ematical formalism expected by the reader), by which is meant the use of 
classical computers to simulate quantum systems (as opposed to the hypo- 
thetical use of quantum computers for simulation). The reason for this is as 
follows: It is often very useful to study the theoretical properties of infinitely 
large physical systems because this gives us valuable information about the 
bulk properties of the system — that is, the properties of the system when 
effects due to the presence of the physical boundaries of the system are neg- 
ligible. In the infinite setting, physical quantities such as the total energy or 
the total magnetization of the system that increase as the number of particles 
increase will all be infinitely large, so merely learning that these quantities 
are divergent is not useful; instead, what we really want to know is what the 
functions of the quantities are with respect to the size of the system. It is 
for this reason that working with weights that are sequences of values from a 
semiring supplies exactly what is needed to extract the desired information 
from a simulation. 

The remainder of this paper shall be structured as follows. In £j2]we shall 
formally introduce what we shall call a diverging automaton, which is a kind 
of automaton that maps infinite words to infinite sequences of weights from a 
semiring. In §3] we shall define a kind of power series over infinite words that 
we shall call a diverging power series, as well as rational operators on this se- 
ries, and then we shall prove a Kleene Theorem connecting rational diverging 
power series and rational automata. In §H we shall likewise introduce a kind 
of automaton that maps biinfinite words to biinfinite sequences of infinite 
sequences of weights, which we shall call a bidiverging automaton, and in £JS] 
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we shall likewise define a kind of power series over biinfinite words, which 
we shall call bidiverging power series, as well as a set of rational operations 
on these power series, and then we shall prove a Kleene Theorem connecting 
bidiverging automata to bidiverging power series. In §[6] we shall discuss how 
bidiverging automata are applied in practice for the purpose of simulating 
quantum systems. Finally, in §7] we shall present conclusions. 

2 Diverging Automata 
2.1 Preliminary Formalism 

In this section we shall define a form of automaton that has the behavior of 
mapping infinite words to infinite sequences of values from a semiring. First, 
however, we need to add some structure to infinite sequences in order for us 
to be able to use them as coefficients on infinite words. 

Let S be an arbitrary semiring. It will often prove convenient to take 
advantage of the fact that S = N — > S in order to be able to use function 
notation for defining sequences, so if v G >S N then v (i) refers to the (zero- 
based) i th element of v, and i y-¥ v(i) defines a sequence that is equal to 
v. We now define three operations on S . addition, left-multiplication by 
members of S, and right-multiplication by members of S. For all sequences 
a,b G S N , we define a + b := (i i-> a(i) + b(i)), i.e. addition acts pointwise 
on the elements of the sequences. Given s 6 S and i; 6 S N , we define left 
multiplication by letting s • v — sv := (i i-> sv (i)), i.e. left-multiplication 
by elements of S multiplies all elements in the sequence by that factor, and 
likewise we define right-multiplication by letting v ■ s = vs := (i t— ?■ v(i)s). 
Finally, we let the zero (additive identity) of be the zero sequence, j ^ 0. 
It is not difficult to see that all of the semimodule laws are obeyed by these 
operations and the choice of zero, and so we conclude that S N is both a left 
and a right S'-semimodule (that is, an S-semibimodule). 

Let A be some alphabet. Words from this alphabet include elements of 
A*, which we call finite words; elements from A u = A N , which we call infinite 
words; and elements from A^ = A z , which we call biinfinite words. In all 
cases the character of a word at zero-based position i is denoted by w^], so 
that for example wm refers to the third character of a finite or infinite word, 
and W[-s] refers to the character at position —3 of a biinfinite word. 

The notation W[ s:e ] denotes the substring of w starting at the s th character 
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and going up to but not including the e character. If e is finite then 
W[s-.e] = W[ 8 ]W[ a+1 ] . . . W[ e -i] and if e is oo then w [s]e] = w [s] w [s+1] in both 
cases (w[ s:e ]) [{] := w [s+i] . 

The length of a word w is given by \w\ which will be finite for finite words 
and oo for infinite words; we will denote the unique string of length by e. 
Note that |w[ s:e ]| = e — s, which may be infinite. 

Words can be concatenated to form other words, so that for example 
given words a £ A* and b G A u the word ab is the sequence a followed by 
the sequence b, i.e. 



(ab)(i) = 




i < \a 
a\) i > \a 



Note that in the sequel we will continue to use S to denote an arbitrary 
semiring and A an arbitrary alphabet unless stated otherwise. 



2.2 Diverging Automata Defined 

We now define the structure of the automaton as follows: Given an alphabet 
A and a semiring S, we define a diverging automaton A over A and S to be 
a tuple, (Q A , I A ,F A ,M A ), where 

• Q A is the set of states in the automata; 

• I A G S® is the initial distribution of states, which for convenience we 
shall assume to be non-zero unless stated otherwise^] 

• F A G is the final distribution of states, which for convenience we 
shall also consider to be non-zero unless stated otherwise; and 

• M A G S AxQxQ = (S((A))) QxQ is the tensor providing the weighted 
transitions between states for each input symbol — or, equivalently, a 
matrix of formal power series over the input alphabet with coefficients 
in S. 

It will be convenient to establish some conventions at this point. First, 
when defining an automaton A we shall do so by defining Q A , I A , etc. with 

*We do this because it prevents us from having to deal with a number of trivial and 
uninteresting cases in many proofs. 
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the understanding that A := (Q A , I A , F A , M A ) (where the superscript on the 
elements of the tuple denote the automaton with which they are associated) 
so we do not need to relate the elements of the tuple to A explicitly. Second, 
when we refer to the states in I A or the states in F A we shall be implicitly 
referring to the states with non-zero weight, which we shall call respectively 
the initial states and the final states, or the initial state and the final state 
if there is only one such state. Third, the notation Q A U {q} will always 
mean adding a fresh state to Q A unless otherwise stated (that is, if there 
was already a state named q in Q A then it will be implicitly be relabeled to 
q' and the rest of the automaton adjusted accordingly). Finally, if R C Q A , 
then I A will refer to the block of I A for the states in R, and likewise for F A 
and (both subscripts of) M A . 

Note that at this point there is nothing that distinguishes this automaton 
from a weighted automaton over finite words which we will henceforth call a 
converging^ automaton. The isomorphism between diverging and converging 
automata shall prove useful throughout this paper, and so we shall denote by 
A the converging counterpart of A, by which we mean the automaton that 
has the same tuple as A. 

M A induces the structure of a directed graph where each (directed) edge 
is labeled by a value in S{{A)), i.e. by a sum over allowed input symbols with 
a coefficient from S on each; by convention, when we refer to the directed 
edges in the graph without a qualifier we will only be referring to the edges 
with a non-zero label, since these are usually the only ones we care about. 

Because of the graph structure of an automaton, we can (and shall) mean- 
ingfully talk about the path or paths taken by a word, which we define as 
follows: a path for a word w is a sequence {qi}o<i<\ w \ (which may be infinite) 

such that for every < i < \w\ we have that M q j££l ^ 0. A finite path is 
said to be viable if it starts on an initial state and ends on a final state. 

We now need to define the behavior of an automaton A, denoted by 
||^4|| G S n ((A wS j), which describes the formal power series recognized by A 
as a function of the elements in its tuple. For convenience, we observe that 
S N ((A U >)) = (A u x N -)■ S), and so we shall denote by A(w, ■) the value of 
the (infinite sequence) coefficient on the infinite word w and by A(w, i) the 

2 The word 'converging' was chosen both to provide a nice contrast to 'diverging' and 
also because the word 'finite' has essentially already been taken in the context of automata 
to refer to the number of states — as in 'finite state automata'. 
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value at the (zero-based) i th position of A(w, •). Equivalently we have that, 

||^4|| = (n i — y A(w,n)) ■ w. 

To define the behavior, we recall that we stated previously that an im- 
portant motivation is to model the rate at which the infinite sum-of-products 
in an automaton diverge for infinite words. Given this, an obvious definition 
for the behavior of A is to let A(w, n) be equal to the weight that W[o :n ] has 
in A (which recall is the converging automaton with the same tuple as A), 
namely 

A(w, n) := I A ■ ( JJ M^A ■ F A . 

(Note: The definition above will eventually be modified for reasons that 
will be shown later, but we start with it in this form for pedagogical reasons; 
the final form will be presented in §2.51 ) 



2.3 Examples 

To see the consequences of this definition, we shall now consider some exam- 
ples. For the automata defined in these examples, we shall use the standard 
form of diagrammatic notation: 

• states are denoted by circles; 

• the set of non-zero (weighted) transitions are denoted by arrows be- 
tween states which have labels of the form l x : y 1 where x G A is the 
input symbol for the transition and y e S is the weight of the transition; 

• the set of initial states are denoted by arrows with an end state but no 
starting state and labeled with the initial weight; and 

• the set of final states are denoted by arrows with an starting state but 
no ending state and labeled with the final weight. 

For the sake of being explicit, values will also be given for each of the quanti- 
ties in the 4-tuple for each automaton; the transition matrix M will be given 
as a matrix of pairs of semiring coefficients and input symbols. 
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a:T a:T 



A := {a, (3}, S 



T 




) 



) 



Figure 1: Automaton Example ^1 



We first consider the automaton A\ in Figure [TJ It is not difficult to see 
that for all words in the language a*(3a u we have that 



because for a prefix of ma symbols it takes at least m + 1 steps to make it 
to the state q2, which is the only final state, and once at q2 the automaton 
loops forever (for this word). 

We next consider the automaton A2 in Figure [21 where we observe that 



To see why, first note that all viable paths must start at qi and end at #3 
because the former is the only state with outgoing transitions for a and the 
latter is the only final state. Based on this, we immediately conclude that 
the value for (a/?)^ must be zero for even n because only an odd length path 
can make it from q\ to q-$. Next we note that the only viable paths with 
odd length n are those which consist of (n — l)/2 round trips between qi 
and either q 2 or g 3 , followed at the end by a step from qi to ^3 of weight 1. 
There are 2 (n_1 * ) / 2 independent paths of this form (as each round-trip in each 
sequence independently chooses q2 or q 3 as a destination), and furthermore 
each path has total weight 2 <n ~ 1 - ) / 2 (a factor of 2 for each round trip), so 
because the final weight is 2, the value for (a/3) w for odd n is therefore 
2(n-i)/2 . 2(n-i)/2 .2 = 2™. By applying similar reasoning, it is not hard to see 



F n < m + 1 
T n > m + 1 




n even. 
n odd. 
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A:= {a,(3},S:=N, 



( fO\ ( (2, a) (l,a)\\ 

^ 2 := {91,92,93}, (1 2 0), , (l,/3) 

V V 2 / \(2,/3) 

Figure 2: Automaton Example #2 
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a:l 




A:= {«,/?}, 5:=N, 



/0\ /(l,a) (l,a) 
{?i,«2,53},(l 0), 1 , (l,a) 

Figure 3: Automaton Example #3 
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that 







n = 







n odd 



2 



n even. 



(If this result is not obvious, observe that = /3(a/3) aJ and that q 2 has 

initial weight 2.) 

Finally, we consider the automaton ^4.3 in Figure [3j for which we observe 
that ^4.3 n) = n due to the fact that for every n there are are n paths of 
length n starting at q% and ending at q 2 , and each path has weight 1. It is 
also straightforward to see that 



2.4 A Problem 

In the above examples we provided some illustrations for how the definition of 
the behavior works out in practice, but there is one very important property 
of this definition that has not yet been touched upon: since the computation 
of A(w, n) only depends on the first n characters of w, all words with the 
same prefix of length n have the same value at n in their sequences — that 
is, given words w and v such that u>[o:n] = ^[o:n]j it immediately follows 
that A(w,n) = A(v,n). This property is unfortunate because it means 
that there are many strings that intuitively should be entirely rejected by a 
given automaton — that is, mapped to the zero element of S^, which is the 
sequence with all zero entries — that instead are accepted by the automaton 
— that is, mapped to a sequence with non-zero entries. 

To see examples of this, first consider again the automaton in Figure [TJ 
This automaton is designed to filter out strings with more than a single (3, 
and yet AilPfia^, n) = 5 n i 7^ 0, where 5ij is the Kronecker delta, 




and 




< n < m 




otherwise. 
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Likewise, if we consider again the automaton in Figure H] we see that although 
strings with more than a single a ought be rejected entirely we actually have 
that A.2(aP,n) = 25 n \ ^ 0. Finally if we consider again the automaton in 
Figure [3] we see that although no (3 should follow an a, we actually have that 
A 3 (af3",n) = 5 nl . 

Put another way, given an arbitrary language L we would (naively) expect 
that if we took a Biichi automaton that recognized L and converted it to a 
diverging automaton by labeling the existing transitions with weight 1 (in the 
semiring B) and the non-existing transitions with weight 0, then we would 
end up with an automaton that only accepted words in L, but from the 
preceding discussion we know that this will not be in true in general. 

2.5 A Fix 

The problems described above ultimately come from the fact that our defi- 
nition for the behavior has the undesirable property that later parts of the 
word cannot affect early parts of the sequence, so by the time a word has 
hit a dead end that would have caused it to be rejected were it finite, it 
has already generated a non-zero subsequence and hence cannot be entirely 
rejected under the current definition of the behavior. So, in a matter of 
speaking, if we want to make the behavior of our automata more sensible, 
we need a way for this future information to travel backwards in time to the 
beginning of the word. 

Fortunately, we can do exactly this in a way that borrows a page from 
the Biichi playbook (see Refs. j3] and j2]). In a Biichi automaton at least 
one of the final states must be visited infinitely often for an infinite word to 
be accepted. This condition provides exactly what we need, because a word 
will hit a dead end if and only if it fails to visit a final state infinitely often, 
though because our automata have weights we need to be a bit more careful 
about how we define 'visit infinitely often' because it is possible for there to 
be multiple paths that are individually non-zero but which cancel when they 
meet at particular states. Thus, we say that a word visits a state q infinitely 
often if there is a sequence of monotonically increasing indices {nj} ig N such 
that for every i the sum of all paths of length rii ending on state q is non-zero 
- or, conversely, a word visits a state q only finitely often if there is an N 
such that the sum of all all paths with length greater than N eN ending on 
state q is zero. 

Thus, based on the above considerations and inspired by the Biichi au- 



13 



tomaton acceptance condition, we modify our definition of the behavior of a 
diverging automaton as follows: 

A(w, n):=I A - (jj M A ' W ^ ■ F A ' [w K 

where the changed factor (which has been underlined solely to highlight it), 
pA,[w]^ denotes F A with the weights of all states set to zero except for those 
visited infinitely often by w, which we shall call the states activated by the 
word. Similar to the case of finite words, the right-hand side can equiva- 
lently be interpreted as the sum of all the viable paths taken by u>[o :n ] where 
each path bears a weight equal to the product of transitions along the path 
(U^=o M A ' W M), the weight of the initial state of the path (I A ), and the weight 
of the final state of the path (F A '^). This follows from the fact that the 
above sum can alternatively be expressed as 

A( w „\ ._ V t^m A ' W[0] M A,W[1] ■ ■ ■ M A ' W[n - 1] F A[w] 

S\.\W,Ib).— / j l qo lVI qoqi lVI qiq2 lVlq n _ iqn r Qn , 

qo,—,Qn£Q 

where each non-zero term in the sum has a separate assignment of {qi}o<i< n 
C Q that corresponds to a viable path with a sequence of states equal to 
{qi}o<i<n C Q, and vice versa. 

With this new definition for the behavior, all of the problems that we 
listed earlier disappear because in each case no states become activated and 
so all F A '^ = 0. However, in cases where all states become activated, 
converging and diverging automata behave similar to how our old definition 
played out, as shown in the following Lemma. 

Lemma 1 (Conditions under which diverging matches converging). Given a 
diverging automaton A and its converging counterpart A, if for some infinite 
word w we have that F A ^ = F A , then for all n GN we have that A(w, n) = 
A(w [0:n] ). 

Proof. Follows immediately from the definitions of the behaviors of diverging 
and converging automata and the fact that the tuples of A and A are equal. 

□ 

2.6 Elementary Operations 

Before leaving the subject of diverging automata, we take a moment to define 
a couple of elementary operations. First, given an automaton A (converging 
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or diverging) over some semiring S and scalar values l,r e S, we define 
l-A-r = lAr such that Q Ur := Q- 4 , l\ Ar := I ■ I A , F lAr := F A ■ r, and 
M Wr := M- 4 — that is we left-multiply the initial state vector by I and we 
right-multiply the final state vector by r and we leave everything else as is. 
Second, given another automaton B, we have that A + B is given by 

Q A+B : = 
jA+B . = 



B 




F A+. 



M A+B 



that is, essentially the two automata are simply merged into a single automa- 
ton but kept separate from each other. Finally, we define the zero (additive 
identity) automaton by letting Q° := 0, 7° ;= := 0, and M°. = 0. These 
operations make automata into S'-semibimodules, and it turns out that the 
behavior operator ||-|| is a semibimodule homomorphism, as the following 
Lemma demonstrates. 

Lemma 2 (Behavior is homomorphism (for converging and diverging au- 
tomata)). Given automata A and B over some semiring S and scalar values 
a, (3,"f,5 G S we have that \\aA7 + f3B5\\ = a \\A\\ 7 + /3 ||#|| 5 

Proof. Follows straightforwardly from the definitions, so the proof has been 
left as an exercise for the reader. □ 



3 Diverging Power Series 

3.1 Diverging Power Series Defined 

Having introduced a new form of automaton in the previous section, we shall 
now introduce a corresponding new kind of power series, and then prove a 
Kleene Theorem to formally connect the two constructions. We start with 
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some definitions. Recall that S is understood to be an arbitrary semiring 
and A an arbitrary alphabet. We then define the converging power series to 
be the set of power series over A* with coefficients in S, which is denoted by 
S{{A*)), and the diverging power series to be the set of power series over A^ 
with coefficients in (S N ) A \ which is denoted by (S N ) A * ((A")). 

For convenience we shall use function notation for power series, as we 
have for automata, so if x E S{{A*)) is a converging power series then x(w) 
is the coefficient on the word w, and if y E (S W ) A * ((A u )) is a diverging power 
series then y(p,w,-) is the coefficient on the word w for the prefix p, and 
y(j),w,i) is the i th coefficient of y(p,w, •). 

The peculiar first argument in y above is the prefix argument and it 
is used to pass information about what the prefix of the string is in order 
to ensure that the Biichi boundary condition is satisfied; the need for this 
argument will become more apparent later in this section. Obviously we will 
be interested in the case where this argument is set to zero, so we define the 



operator [-J : (S") A * {{A u )) -> S m ((A")) by [v\ (w,i) := v(e,w,i) where e is 



the empty word. Power series of the form S N ((A U )) we shall call prefix-free 
diverging power series. 

We will often be taking sums over substrings of words, so given a word 
w E A* U A w , we will use the notation, 



to mean the sum of the value of / over all strings si, . . . , such that the 
concatenation of si through is equal to w. (Note that Sjy may be infinitely 
long.) 

3.2 Semiring-Semimodule Structure 

Having established some basic notation, we move on to endowing S((A*)) 
with the standard *-semiring structure. Specifically, given the converging 
power series x,y E S{{A*)) we define addition by x + y := w h- > x(w) + y(w); 
multiplication by x • y := w i— > J2 a b =w x(a)y(b); the additive identity by 
w i— >■ 0; and the * operator by x* := X^o x *' ^ ne ^ as ^ °^ wri ich is well-defined 
if and only if x(e) = 0. These definitions make S{{A*)) into a *-semiring. 

We now turn our attention to (S' N ) yl * ((A^)). We first endow it with the 
standard form of addition and additive identity: Given diverging power series 
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x, y G (S N ) A * {{A^)) we define (x + y)(p, w, i) := x(p, w, i) + y(p, w, i) and we 
define the zero element to be (p,w,i) h- >■ 0. We next turn our attention 
to left-multiplication by elements in S((A*)), so we let I G S((A*)) be some 
converging power series. A typical approach at this point would be to define 
(I ■ x)(p, w, i) := Ylab=w K a ) x (.Pi *)> but this has a behavior that is not self- 
consistent in two senses: First, left-multiplying a word in A u by a word in A* 
has the effect of shifting the infinite word to the right to make space for the 
new prefix, whereas left-multiplying a sequence in S 11 ^ by a value in S does not 
shift the sequence at all, but only pointwise scales its elements; put another 
way, if we used the definition above then the result would be an operation 
that caused the input symbols in the word to no longer line up with the values 
in the coefficient. Second, the prefix argument of the right operand is not 
made aware of the existence of the left operand, but when we prove a Kleene 
Theorem later this will turn out to be crucial information that needs to be 
propagated. Thus, to make the operation of left-multiplication consistent in 
its action on the coefficients and the word, as well as to propagate the prefix 
information, we define it by 



(Note that although the above sum is technically over an infinite number of 
terms, it is nonetheless well-defined because for every i 6 N there are only 
finitely many a G A* such that \a\ < i.) The definition of left-multiplication 
now guarantees that the sequence coefficient on the infinite word is shifted 
by the same amount as the infinite word itself, thereby making the operation 
self-consistent. 

We shall now establish that, under the above definitions, the power series 
S{{A*)) and (S®) A * ((A u )) form a semiring-semimodule pair. Since we already 
know that S((A*)) is a *-semiring, all that is left is to prove that (5* N ) A * ((A^)) 
satisfies the semimodule laws. 

Lemma 3 (Semiring-semimodule laws). Let s,s' G S{{A*)) and let v,v' G 





ab=w 



where =4 is a right-shifting operator defined by 




(S*) A '((A U )). Then 
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1. s(s'v) = (ss')v; 

2. (s + s')v = sv + s'v; 

3. s(v + v') — sv + sv'; 
4- lv = v; 

5. Of = 0; and 

6. sO = 0. 

Before proving this Lemma, we first shall first introduce a Lemma enu- 
merating some useful properties of shifts. 

Lemma 4 (Properties of shifts). For all s E S((A*)), v E (S N ) A * {{A u ))> and 
a,b E A* we have that 

1. v =?e = v; 

2. (v^ a )^ b = v^ ba ; and 

3. (svr c (p,w,i) = £^a(a)t;=*»(p,M)- 

Proof. 1-2. Follows directly from the definition, and so the proof has been 
left as an exercise for the reader. 



(sv )^ c (p, w, i) 



I (p',w',i') H> E 

\ ab=w' 



=tc 



s(a)v^ a (p',b,i') 



(p,w,i) 



% < \c\ 

T.ab= w s(a)v^ a (pc, b,i-\c\) i> \c\ 

% < \c\ 

v^ a (pc, b, i — |c|) i> \c\ 

i < \c\ + \a\ 

v(pca, b,i — \c\ — \a\) i > \c\ + \a\ 



= E 

ab=w 

= E s ( a ) 

ab=w 

= ^8(a)'V^(p,b,i). 



ab=w 



□ 
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The proofs that the semiring-semimodule laws hold for the pair S((A*)) 
and {S n ) A *{{A")) follow straightforwardly given Lemma H] and the fact that 
(S, S ) is a semiring-semimodule pair; we present explicit proofs for the first 
two laws and leave the rest as exercises for the reader. 

Proof of Lemma [3J In this proof we make implicit use of the properties listed 
in Lemma HJ 

1. 

s(s'v) = (p,w,i) i — y s(a)(s'v) =ta (p, b, i) 

ab=w 

= (p, w,i)^J2 S ( G ) ( s'{c)v^ ac {p, d, i) J 

ab=w \cd=b / 

= (p, w,i)t-> 2J s(a)s'(c)v^ ac (p, d, i) 

acd=w 
ed=w ac=e 

= (p,w,i) ^ ^2(ss')(e)v^ e (p,d,i) 

ed=w 

= (ss')v 

2. 

(s + s ')v = (p, w, i) ^ Y ( s ( a ) + s '( )) v=to (P> i) 

ab=w 

= (p, w,i) !->• Y" ( s ( a ) v ^ a (Pi b, i) + s'(a)v^ a (p, 6, i)) 

ab=w 

= sv + s'v 

3. Same idea as in 2; details left as an exercise for the reader. 
4-6. Very simple; left as exercises for the reader. 

□ 

Before continuing, there is one last piece of structure with which we shall 
endow S((A*)) and (S N ) A * ((A^)): that of an S'-semibimodule. Because we 



19 



have already defined addition and an additive identity it suffices to define 
left and right multiplication by elements from S, so let s G S and let either 
v G S((A*)) or v G (5' N ) yl * ((A u )), and then for both cases of v let s • v — sv := 
(s • e)v and v • s — vs :=(...) (->■ v(. . . )s, where s ■ e denotes the formal power 
series s ■ e rather than recursive application of left-multiplication of e by s. 
With these definitions it is not hard to show that the semibimodule laws are 
obeyed, and so we now have that both S((A*)) and (5" N ) A * ((A w )) are both left 
and right S-semibimodules. 

3.3 Infinite Iteration 

We have finished endowing S((A*)) and (5' N ) A * {{A^)) with the structure needed 
to make it a semiring- semimodule pair, and now one important piece of struc- 
ture remains: an infinite iteration operation that allows us to construct di- 
verging power series from converging power series. We do this as follows. 
First we introduce a bit of terminology. Let the function 

^(-,;-)e(A*xA u xS((A*)))^{0,l} 

be defined such that p^(p, w, s) is 1 if pw has infinitely many prefixes in the 
support of s, and otherwise. Now for all s G S((A*/{e})), define the w 
operation by 

s u (p,w,i) := s*(w) -p u (p,w,s) (1) 

Here we finally start to see the importance of the prefix argument — it serves 
as a way to tell the iterating part of the word about what has come before 
it in order to help it correctly enforce the boundary condition. 

Given the above definition for the w operation, we conclude that S((A*)) 
and (S n ) A * {{A^)) form a *-semiring — "-semimodule pair. However, impor- 
tantly, note that they do not form a Conway pair because, in particular, the 
identity s w = s(s w ) does not hold. To see an example of this, let S := B, 
A := {a} and s := T • a; then observe that |_s w J {o^,i) = T, whereas 

[s(s")\ (a u ,i) = s(a) ■ (s")^ a (e,cO) 

that is, L s ( stJ )J ( fttJ ? ') is the same as |_s w J •) but shifted to the right and 
with the empty space filled with the zero of the semiring. 
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The fact that the Conway laws do not hold does not prevent us from 
proving results such as a Kleene Theorem, but it does mean that as we do so 
we cannot apply the theory developed in Refs. [9] and (TO] which specifically 
require the structure of a Conway ""-semiring — ^-semimodule pair. 

3.4 Rational Diverging Power Series 

We have now defined all of the operations that we need to formally define the 
rational power series. Due to the existence of an operation that maps from 
converging power series to diverging power series, namely w , we shall define 
rational power series to be a subset of the sum (or equivalently, disjoint 
union) of the converging and diverging power series. Specifically, we shall 
define fRat(S,A) to be the smallest subset of S((A*)) + (S N ) A *((A W )) such 
that 

1. for all s E S and a e A, s ■ a e 9\at(S, A); 

2. 9tat(S, A) is closed under finite sums; 

3. 9\at(S, A) is closed under left multiplication by elements of 9tat(S, A) fl 

S{(A*)}; 

4. 9Kat(S, A) is closed under left and right multiplication by elements of 
S; 

5. for every s E D\at(S,A) n S((A*/{e}}), both s* E fRat(S, A) and E 

mt{s,A). 

The rational converging and diverging power series are then defined respec- 
tively by mt*(S,A) := mi(S,A) n S{(A*)) and 9tal w (S, A) := *Rat(S, A) n 
(S n ) A *{(A»)). 

The next Lemma shows that there is a simple characterization of 9^0^(5, A). 

Lemma 5 (Characteristic representation for diverging power series). A di- 
verging power series z E (S^) A * ((A u )) is rational if and only if there exists a 
finite index set I and indexed sequences {xi,yi : yi(e) = 0}j e / C tRaf*(S,A) 
and {si}i £ i C S such that Xiyfsi = z. 

(Both the statement of this Lemma and its proof have well-known ana- 
logues in the case of unweighted words; see Theorem 3.2 in Chapter 1 of Ref. 

nsio 
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Proof. Given the I, {xi,yi} ie j, and {sj}i 6 / described in this Lemma, it is 
easy to see that x iUi s i i s rational, so the details are left as an exercise 
for the reader. 

Now let X be the (disjoint) union of D\at*(S, A) (the rational converging 
power series) and the set of diverging power series which can be written in 
the form ^2 ieI Xiyfsi where J is a finite index set and for all i G / we have 
that Xi,yi G D\ai*(S, A), y^e) = 0, and G S. Observe that: 

1. For all s G S and a G A, s ■ a G X. 

2. X is closed under finite sums. 

3. X is closed under left multiplication by elements of X n S((A*)). This 
is obvious in the case where the two elements are both in *Rat*(S, A). 
In the case where s G X n S((A*)) and v G X n (S N ) A * ((A")), then we 

See that SV = sJ2i x iVi s i = J2i( SX i)yi S i = ^2i W illi S i with w i '■= sx i- 

Since Wi is rational (as it is the product of two rational converging 
power series) and yi is rational (by assumption) we therefore conclude 
that sv & X. 

4. X is closed under left and right multiplication by elements of S. This 
is obvious if the element in X is in 9\at*(S, A) so let s G S and v G 
X n (S N ) A * Since left multiplication by s is the same as left 
multiplication by s-e then we immediately have that sv G X by point # 
3 just above. For right multiplication we see that vs = (J2i x iyf s i) s = 
J2i x iUi( s i s ) = Y J i x iUi s 'i where s- := SjS, and so G X. 

5. Given any s G X n 5((A*/{e})), both s* G X and s w G X. 

The set X therefore contains 9 c lat(S', A), and so we are done. □ 

Having defined rational power series, we now define recognizable power 
series as follows: The set of recognizable diverging power series over the 
semiring S and the alphabet A, *Rt(?(S, A) C S N {{A U )), is exactly the set 
of prefix-free diverging power series that are the behavior of some diverging 
automaton, i.e. the set such that for every x G VKtC u (S,A) there exists a 
diverging automaton A such that ||«4|| = x; analogously, we define the set of 
recognizable converging power series, denoted by D\cc*(S, A), to be the set of 
power series that are the behavior of some converging automaton. 

With this terminology we shall now state a Kleene Theorem that connects 
rational and recognizable series. 
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Theorem 1 (Kleene's Theorem for diverging power series). \%iai w (S, A)\ = 
Vice" (S, A). 

Proving this result will take up the remainder of this section. 

3.5 Known Results about Finite Power Series 

We shall build our way to the proof of this Theorem by first proving a series 
of intermediate results. Lemma |5] tells us that we can decompose diverging 
power series into a finite number of operations on converging power series, 
and conversely we can build any diverging power series using a finite num- 
ber of operations on converging power series. Thus, if we had a way to 
immediately translate a converging power series to converging automata and 
back then we would be well on our way to proving the main theorem; for- 
tunately this is exactly what we have in the form of the very well-known 
Kleene-Schiitzenberger Theorem: 

Theorem 2 (Kleene's Theorem for converging power series). D\at*(S, A) = 
mtc*(S,A). 

Proof. See Refs. [22], [20], \Fb\ and p]. □ 

We will want to connect automata together in various configurations, so 
to make this easy it would be nice if could express an arbitrary converg- 
ing automaton in a form that only has a single starting and ending point. 
Specifically, we prefer to work with automata which we shall call normalized 
automata, which satisfy the following properties: 

1. there is exactly one initial state called the start state; 

2. there is exactly one final state called the end state; 

3. the initial weight of the start state and the final weight of the end state 
are both 1; 

4. there are no edges that end on the start state; and 

5. there are no edges that start on the end state. 

(That is, if A is normalized and 1 is the start state and 2 is the end state, 
then we have that if = 6a, Ff = and M$ = = 0.) These automata 
have the property that they must reject the empty word, as the following 
Lemma shows. 
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Lemma 6. If Af is a normalized automaton then it rejects the empty word. 

Proof. This comes directly from the fact that the initial states and the final 
states have no overlap. □ 

We fortunately have a well-known result that tells us we can always as- 
sume we are working with a normalized automaton — though we shall specif- 
ically be proving a variation of this result that assumes that the automaton 
does not recognize the empty word because we do not have e-transitions in 
our definition of automata. 

Lemma 7 (Qualified existence of an equivalent normalized automaton). For 

every converging automaton A that rejects the empty word there exists a 
normalized converging automaton Af that has the same behavior as A. 

Proof. Let Af be defined as follows: Let := Q A U {1,2}, if := 6 a , 
if = 5 i2 , and 



(M A t,j,eQ 



M M - := 



A 



What we have done is create a new initial state and a new final state, and to 
the former added copies of all of the edges outgoing from the initial states in 
A, multiplying the weights on these edges by the initial weight of that state, 
and to the latter added copies of all the edges incoming to the final states in 
A, multiplying the weights on these edges by the final weight of the state. 

Now let w be an arbitrary word. First observe that if \w\ = then 
Af(w) = = A(w) because there is no overlap between the initial and final 
states. Next observe that if \w\ = 1 and W[o] = a for arbitrary a G A then 

Af(a) = I N ■ M N ' a -F N =Y^ ifM^Ff = I A ■ M A,a ■ F A = A(a). 

Finally observe that if \w\ — n > 1 then 

Af( w ) = I N ■ M N > w w . . . M-^'^f™- 1 ) • F N 

= Y, I t- M t,Q m ---<? n - 1]F > 



3 



A(w), 
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and we are done. 



□ 



Corollary 1. For every rational converging power series which has weight 
for the empty word, there exists a normalized converging automaton that 
recognizes it. 

Proof. Follows immediately from Theorem [2] and Lemma [3 □ 
3.6 Loopback Automata 

It will also be useful to work with normalized automata with the property 
that the initial and final state are the same, so we shall define a loopback 
automaton to be an automaton with the property that ij = Fi — Su — that 
is, such that there is only a single state, with initial and final weight one, 
that is the only initial and final state; we shall call this state the loopback 
state. 

It will be useful to categorize the ways that paths travel through loopback 
automata, so we say that the number of times that a path has made a circuit 
returning to the loopback state is equal to the number of trips it has made, so 
in particular a single-trip path is a path that starts and ends on the loopback 
state but does not pass through it again in between. 

There is a natural transformation called rolling that takes us from a nor- 
malized automaton to a loopback automaton: Given a normalized automaton 
M with start state 1 and end state 2, let the roll of M be the automaton L 
given by Q c := Q /V /{2}, if := if := 6 ih and 

M^:=H 3 = 1 
13 [Mff otherwise 

so that 1 is the loopback state. That is, we delete the end state, redirect all 
edges that ended on the end state to the start state, and then set the final 
weight of the start state to 1 so that the start state is now the loopback state. 

Rolling has an inverse operation called unrolling: Given a loopback au- 
tomaton C with loopback state 1, the unroll of £ is the normalized automaton 
Af, given by Q N := Q c U {2}, if := S a , Ff := 5 l2 , 

{0 j = 1 or % = 2 
Mg J =2 
Mfj otherwise 



25 



so that 1 is that start state and 2 is the end state. That is, we add a new state 
with final weight 1, redirect all the edges ending on the loopback state so 
that they now end on the new state, and set the final weight of the loopback 
state to 0, with the end result that the old loopback state is now the start 
state and the newly added state is the end state. 

Lemma 8. Rolling and unrolling are inverse operations (modulo possibly 
reordering states). 

Proof. The only parts of the automaton impacted by these transformations 
are the start state which changes to the loopback state and back again, and 
the end state which is deleted and re-added (and vice versa). The initial and 
final weights of these states are fixed since the automaton is either normalized 
or loopback, and so the inverse will always restore them to their original 
values (modulo possibly reordering states). The edges are left unchanged 
except for those that end either at the loopback state or at the end state; 
because normalized automata have no edges ending at the start state, rolling 
essentially just has the effect of interchanging a zero column in M with a 
non-zero column and then deleting the (interchanged) zero column, which is 
exactly inverted by the unrolling operation, and vice versa. □ 

The following Proposition gives us a useful specialization of Kleene's The- 
orem for the case of converging loopback automata. 

Proposition 1 (Converging loopback automata recognize * of rational power 
series). The set of power series recognized by a converging loopback automaton 
is equal to the set of power series taking the form s* where s is a rational 
power series such that s(e) = 0. 

There are a couple of preliminary results that will be useful for proving 
this Proposition. 

Lemma 9 (Sum of single-trip paths in a loopback automaton equals sum in 
the unroll). Given a loopback automaton A and a finite non-empty word w, 
the sum over all single-trip paths is equal to the sum over all viable paths for 
w in the unroll of A. 

Proof. Left as an exercise for the reader. □ 

Lemma 10 (Behavior of converging loopback automata). Let A be a con- 
verging loopback automaton and s e S((A*/{e})) the converging power series 
recognized by its unroll. Then \\A\\ = s* . 
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Proof. Let w G A*. If |iu| = then A{w) = s*(w) = 1, so assume that 
\w\ > 0. The set of all paths taken by w that start and end at the loopback 
state can be partitioned into subsets based on the number of trips that they 
take. Pick one of these subsets of paths — say, the one with the paths 
that take n trips for arbitrary < n < |iy| ■ - and then observe that this 
subset can be further subdivided into subsubsets such that every path in the 
subsubset visits the loopback state at exactly the same times, which means 
that we can express the sum over this subsubset as a product of factors where 
each factor is a sum over single-trip paths. By Lemma [9] we conclude that 
each of these factors is equal to the weight of the corresponding substring in 
s, and therefore 

i n 

A(w, i) = 5 0i + ^ Yl s ( v k) = s*(w [0 .. i] ). 

n=l vi...v n =VJ[o-.i] j=l 

□ 

Now we are ready to prove our Proposition relating power series recog- 
nized by loopback automata and the * of rational converging power series. 

Proof of Proposition LH First assume that we have a converging loopback au- 
tomaton A. Let A' be the unroll of this automaton. Applying Kleene's The- 
orem for converging automata (Theorem [2]) to A', we conclude that there 
exists a rational converging power series s that is recognized by A' which we 
know from Lemma [6] has the property that s(e) = (as A' is normalized); 
applying Lemma ITUl we conclude that ||^4|| = s* where s is rational. 

Now assume that we have a rational converging power series, s, such that 
s(e) = 0. Applying Corollary [T] (Kleene's Theorem plus normalization) we 
see that there exists a normalized automaton A' that recognizes s. Let A be 
the roll of A'; applying Lemma [TU1 we conclude that ||*4|| = s*, and so we are 
done. □ 

There is an analog of Proposition CD for the u operation. 

Proposition 2 (Diverging loopback automata recognize the w of rational 
power series). The set of prefix-free diverging power series recognized by a 
diverging loopback automaton is equal to the set of power series taking the 
form [s^l where s is a rational converging power series such that s(e) = 0. 
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There again will be a couple of preliminary results that will be useful for 
proving this Proposition. 



Lemma 11 (Condition for satisfying the Biichi boundary condition (for di- 
verging automata)). If A is a converging loopback automaton that recognizes 
the power series s, and w is an infinite word, then w visits the loopback state 
infinitely often if and only if w has infinitely many prefixes in the support of 
s. 

Proof. Because A(w[o : q) is equal to the sum of all paths of length i starting 
and ending on the loopback state, and because A(w[o :i ]) = s(w[o : q), w will 
visit the loopback state infinitely often if and only if there are infinitely 
many i such that s(ti>[o:i]) 7^ 0, i.e. if there are infinitely many prefixes in the 
support of s. □ 

Lemma 12 (Behavior of diverging loopback automata). Let A be a diverging 
loopback automaton such that its converging counterpart, A, recognizes the 
power series s* . Then \\A\\ = [s w \ . 

Proof. Let w be some infinite word. There are two cases: 

1. If w visits the loopback state infinitely often, then F A '^ = F A , so 
by Lemma U we have that A(w,i) = A(w[ 0:i y) = s*(w[o : {\). Further- 
more, because w visits the loopback state infinitely often, it therefore 
by Lemma [TT] has infinitely many prefixes in the support of s*, so 
p UJ (e,w, s*) = 1, which means that s w (e,w,i) = s*(w[ 0:i ]) = A(w,i) for 
all i. 

2. If w visits the loopback state only finitely often, then F A '^ = 0, and 
by Lemma [TT] we have that w has only finitely many prefixes in the 
support of s* and therefore A{w , i) — = s w (e, w, i) for all i. 

Thus we have shown that for all w and all i, A(w,i) = s^(e,w,i) which 
directly implies that = [s u \ , and we are done. □ 

Now we have what we need to prove Proposition[2j which we recall equates 
the behavior of diverging loopback automata and the w operation applied to 
rational converging power series. 

Proof of Proposition First let A be a diverging loopback automaton and 



A be its converging counterpart. By Proposition [TJ we know that 



A 
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for some rational converging power series s. Applying Lemma [T^l we conclude 
that H^ll = Ls w J. 

Now let s be a rational converging power series. By Proposition [I] there 
exists a converging loopback automaton A that recognizes s*; let A be the 
diverging counterpart of A. Then applying Lemma [12] we conclude that 

= L^J- □ 



3.7 Loopback Automata With Preludes 

There is another specialized kind of automaton that will prove useful: We 
say that an automaton is loopback with prelude if it has a start state with 
initial weight 1 that is the only initial state and which has no incoming edges, 
and a loopback state with final weight 1 that is the only final state; we say 
that an automaton is a loopback without prelude if it is an ordinary loopback 
automaton. We say that an automaton is loopback with or without prelude if 
it is either a loopback automaton or a loopback automaton with prelude. If 
a loopback automaton is with or without prelude, then the initial/loopback 
state is defined to be the initial state if the automaton has a prelude and the 
loopback state otherwise. 

One of the advantages of these categories is that we can break down any 
automaton in terms of a weighted sum of them, as the following Lemma 
shows. 

Lemma 13 (Decomposition into loopback automata with or without pre- 
lude). For all automata A there exists a decomposition into a weighted sum 
of automata that are all loopback with or without prelude, i.e. a tuple (K, 
{h,rk}keK, {A k } k£K ) such that \\A\\ = J2keK^k \\A k \\ r k where K is an index 
set, {l k ,r k } k£ K is an indexed set of coefficients in the underlying semiring S, 
and {A k } k( zK is an indexed set of loopback automata each of which is with or 
without prelude. 

Proof. Let A be an automaton. For all states p and q let A pq be defined 
such that Q A ™ := Q A , lf pq = 5 ip , Ff m = 5 iq , and M Apq := M A . If p = 
q then observe that A pq is a loopback automaton without prelude, and if 
p 7^ q then observe that A pq is a loopback automaton with prelude. Finally, 
observe that because the definition of ||.4.|| is homomorphic with respect to 
the elements of I A and F A (and the Buchi boundary condition does not 
break this because it ignores the actual values of I A and F A ) we therefore 
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have that ||^|| = ^Q^WM 

hj) '■= J f> and r (ij) '■= F f- 



3 



J2keK l k\\A k \\r k where K 



Q 2 , 
□ 



There are two important operations that we shall now define that allow 
preludes to be attached to and detached from loopback automata. Given a 
normalized automaton M and a loopback automaton C, M ■ C is the attach- 
ment of M to C, defined as follows: First, let 1 be the start state of Af, 2 be 
the end state of Af, and 3 be the loopback state of C Then 



i 

p N-C 



= Q*/{2} U Q c , 

= 8n, 

= 5 i3 , and 

(M{{ i,j G Q Af /{2} 



Mf 3 - C := < 



Mf 2 ieQ«/{2},j = 3 







otherwise 



That is, we take the direct sum of the two automata, merge the end state 
of A with the loopback state of B (including all edges), and set the initial 
weight of the loopback state to zero. 

Given a loopback with prelude automaton, V, the detachment of V is 
defined to be the pair of automata (A/", C) defined as follows: Let 1 be the 
start state of V and 2 be the loopback state. Then Af is the normalized 
automaton with start state 1 and end state 2 given by 



If 



Q 



V 



T'P 



M: 



■Af ._ 



Mf- otherwise ' 



that is, M is the result of deleting all edges that start on the loopback state 
of V, and taking the initial state as the start state and the loopback as the 
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end state, and C is the loopback automaton with loopback state 2 given by 



Q c 


:=Q V 




■= 8 i2 


F c 


:= F v = 


13 


■= M v 



that is, the result of setting the initial weight of the start state to and 
setting the initial weight of the loopback state to I. 

Lemma 14. Attaching and detaching preludes are inverse operations with 
respect to the power series recognized by the automaton. 

Proof. We shall start with attachment followed by detachment, so let (V, S) : = 
fj,(C) where C := A ■ B; we need to show that ||^4|| = ||D|| and \\B\\ = \\S\\. 
We first prove that \\B\\ = \\£\\. To do this, we first observe that because de- 
tachment does not delete any states, the states in S can be divided into those 
from A (excluding its loopback state) and those from B. We next observe 
that all the states from A are inaccessible because the only edges connect- 
ing states from A and states from B went from the former to the latter, so 
because the initial state is a state from B there is therefore no way to reach 
any of the states in A. Thus, we can throw the states from A out without 
altering the behavior of S, and when we do so we end up with an automaton 
that is exactly isomorphic to B and thus we have that ||£>|| = ||£||. Applying 
the same line of reasoning (with the details left as an exercise for the reader) 
we similarly see that A is isomorphic to D and so ||*4|| = Thus we are 
done with the case of attachment followed by detachment. 

Now we shall consider detachment followed by attachment, so let T> := 
B ■ C where (£>, C) = fi(A); we need to show that ||^4|| = \\T>\\. To do this, 
observe that V is just two copies of A linked by the loopback state, which all 
viable paths must pass through at some point because they must start in one 
of the copies of A and end in the other. Furthermore, the very first time that 
a path lands on the loopback state it immediately moves from the first copy 
of A to the second copy, as the first had its copy of this state deleted with 
the edges redirected to the loopback state in the second copy, and from then 
on the path is unable to return to the first copy; this is significant because 
it means that for each viable path in A there is only a single equivalent 
viable path in V, and vice versa. Thus we see that we are able merge the 
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two copies of A within T> without changing its behavior, as for any word the 
viable paths will not be affected by the merge as the states and edges will be 
the same except for the fact that they will all be in a single copy of A rather 
than having an initial prelude take place in another copy of A. Because the 
merged automaton is exactly isomorphic to A we see that ||*4.|| = \\V\\ and 
thus we are done. □ 

Loopback automata with preludes are useful because of the following fact: 

Lemma 15 (Behavior of attached automata). Let X be a normalized con- 
verging automaton recognizing the power series x, and y be a loopback di- 
verging automaton recognizing the power series \_y Lx) \ for some y where x, y G 
S {{A*)) /S {{e)). Then \\X ■ y\\ = [xy«\. 

Proof. Let Z := X ■ y. Let w be an arbitrary infinite word and % a positive 
integer (as we know that the final weight of the start state is and hence 
Z(w,0) = 0). Observe that all viable paths in Z must land on the loopback 
state at some point, and consider the set of paths for which this occurs for 
the first time at step k of the path where < k < i. We can factor the sum 
over these paths into the product of the sum over all length k paths from the 
start state to the loopback state, and the sum over all paths of length i — k 
looping through the loopback state. Because a viable path cannot access 
any of the states from y until it has landed on the loopback state for the 
first time, and because all of the states in X are present in Z and accessible 
from the start state, with the exception of the end state which has effectively 
been replaced by the loopback state which has the same incoming edges as 
the end state in X, we see that the sum in the first of the two factors is 
exactly equivalent to the sum over all viable paths in X for the substring 
wr 0: fc], which is equal to x(w\ .).]). Using similar reasoning we conclude that 
the second of the two factors is equivalent to a sum over all viable paths of 
length i — k for the substring in y and so is almost equal to something 
like \_y"\ (w[k :oo j,i—k), but in fact this value will be zero if the substring W[k :00 } 
has only finitely many prefixes in the support of y* when what we really want 
is for it to only be zero if the full string w has this property. Fortunately, 
we developed formalism for this very eventuality: [(y")^ 10 ^ 1 ] ( W[k-.oo], i) does 
exactly the right thing. Even better, summing over k we see that 
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\X-y\\ = (p,W,0^^x( W[ o :fc] )(^)^^l(P,^:oo]^) 

k=l 

oo 



fc=0 



(p,u;, «)>->> ^x(a)(^ a (p,&, 



ab=w 



[x ■ y»\ . 



(Note that we are able to extend the sum over k out to infinity because when 
k > i the shift operator on the second factor causes the term to be zero.) □ 

Lemma 16 (Behavior of diverging loopback automata with prelude). The 

set of prefix-free power series recognized by diverging loopback automata with 
prelude is equal to {[xy^l : x, y G Dlat* (A) / S ((e))} . 

Proof. First, let x and y be rational converging power series with x(e) = 
y(e) = 0. By Corollary [T] we know that there exists a normalized automa- 
ton X that recognizes x and by Proposition [2] we know that there exists a 
loopback automaton y that recognizes \_y w \ ■ By Lemma [15] we know that 
attaching these two automata forms a diverging loopback with prelude au- 
tomaton that recognizes \_xy w \ . 

Now let A be a diverging automaton with prelude, (B,C) := p>(A) the 
result of detaching the prelude, and C the result of unrolling C. By Lemma 
[T4l we know that re-attaching B and C forms an automaton with the same 
behavior as A, and by Proposition [2] and Lemma [15] we know that this 
behavior is equal to [bc^l where ||£>|| = b and ||C|| = c are converging power 
series that we know reject the empty string because of Lemma [6] (as B and 
C are normalized). Finally, by Theorem [2] we know that b are c rational. □ 

Corollary 2 (Behavior of diverging loopback automata with or without 
prelude). All diverging loopback automata with or without prelude recognize 
prefix-free diverging power series of the form [xy u \ where x and y are rational 
diverging power series such that y(e) =0. 

Proof. Follows immediately from Proposition [2] and Lemma [16] □ 
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Corollary 3. For all rational diverging power series x and y such that y(e) 
there exists an automaton that recognizes [xy w \ . 

Proof. Let 

w = e 



x'(w) :- 



x(w) otherwise 



By Lemma [T6l we know that there is an automaton A that recognizes [^'l/^J > 
by Proposition [5] we know that there is an automaton B that recognizes \_V W \ , 
and ||.A + a;(e)£|| = [x'y^ + x(e)y UJ \ = [xy u \. □ 



3.8 Proof of the Kleene Theorem 

We now have everything that we need to prove our Kleene Theorem. 

Proof of Theorem Ul First assume that we are given a rational diverging 
power series z. By Lemma [5] we know that z can be expressed in the form 
Y^=i x iUi '■ For each i we know by Corollary [3] that there exists an automaton 
that recognizes L x «Z/rJ • Let A be the direct sum of all of these automata, 
and we see that we are done. 

Now assume that we are given an automaton A. By Lemma [13] we know 
that ||^4|| can be expressed as a weighted sum of the behaviors of automata 
that are all loopback with or without prelude. Since by Corollary [2] each of 
these automata recognizes a power series of the form \_xy^\ for some rational 
x and y such that y(e) = 0, we have that the power series recognized by A 
is rational. □ 



4 Bidiverging Automata 
4.1 Preliminary Formalism 

In the previous sections we have presented automata and power series over 
the domain of infinite words. These words were uni-infinite in the sense that 
they have a definite starting point and proceed towards infinity in a single 
direction. When studying infinite systems in physics, however, we are usually 
interested in the case where there are no boundaries, which means that the 
system stretches out infinitely in all directions. For this reason, in this and 
the next section we shall proceed to extend the formalism that has been 
developed so far into the domain of biinfinite words. 
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Unlike diverging automata, bidiverging automata shall map biinfinite 
words to coefficients in 5* ZxN , where the extra Z effectively adds an addi- 
tional parameter that specifies the starting location in the word; this addi- 
tional argument is needed because unlike the case of infinite words, in the 
case of biinfinite words there is not a natural location at which to start (and 
position does not count because we can always shift the word left or right, 
making the location of position itself an arbitrary choice). As always, we 
observe that S ZxN = Z x N — > S, which means that we can use function 
notation to describe and specify elements in S ZxN . 

We now endow S ZxN with the same kind of S'-semibimodule structure with 
which we endowed S N . Specifically, given x,y G S ZxN , we define addition by 
x + y := (i,n) h-> x(i,n) + y(i,n), given s G S we define left-multiplication 
by s ■ x = sx = (i, n) t— >■ sx(i, n) and right-multiplication by x ■ s = (i, n) t— > 
x(i,n)s, and finally we define the additive identity to be (i,n) h-> 0. It is not 
hard to see that these definitions obey the semibimodule laws and so S ZxN 
is an S'-semibimodule. 

Because biinfinite words extend in two directions, we need to extend our 
terminology in order to define boundary conditions for bidiverging automata. 
First, we define a backwards path to be a path that follows transitions in the 
opposite direction while stepping backwards through a word. Second, we note 
that because there is no obvious first character in a biinfinite word we will 
generally need to specify one, so when we specify a path for a biinfinite word 
we will need to specify the starting character in the word in addition to the 
starting state in the automaton. With this grounding, we say that a biinfinite 
word w visits a state q infinitely often to the left/right if there exists some 
integer i and a sequence of strictly monotonically increasing lengths {nfc}fc e pj 
such that for every k G N the sum of all backward/forward paths of length 

starting at state q and character % of w is non-zero. Conversely, we say 
that w visits a state q only finitely often to the left/right if it does not visit 
q infinitely often to the left /right. 

There is an equivalent way of defining visiting infinitely often but it re- 
quires some additional formalism to define the reversal of words. For an 
arbitrary biinfinite word w, w is the result of reversing the word; that is, 

J v 

w m := w\-i\. The result of applying a reversal operation to an infinite sub- 
string operation is defined as W[ a -.b} '■= w [-b-.-a]- (This definition bypasses the 
need for us to construct a formalism of backwards infinite words.) For an 
arbitrary finite word w, := W[\ w \-i-i]. For an arbitrary biinfinite power 



35 



series s, *s*(w,i,n) = s(W\ —(i + n),n). (Confirming that s = s is left as 
an exercise for the reader.) Given this, we have the following Lemma: 

Lemma 17 (Alternative definitions for infinite visits). Given an automaton 
A, a biinfinite word w visits a state q infinitely often to the right if and only if 
there exists an integer i such that iyr i:oo i visits q infinitely often, and w visits 
a state q infinitely often to the left if and only if there exists an integer j such 
that luj-ooy] visits q infinitely often in the automaton obtained by reversing 
the edges of A. 

Proof. Left as an exercise for the reader. □ 



4.2 Bidiverging Automata Defined 

As with diverging automata we shall use function notation as a convenient 
means of defining the behavior, which we do as follows: 

A(w,i,n) := I A ' [w] ■ (jjM A ' w i^ • F A[w] 



where 



and 



T A,[w] J if w visits state % infinitely often to the left 

otherwise 

-,A,[w] J Ff w visits state % infinitely often to the right 

otherwise 



Note that A can equivalently be interpreted as the sum of all viable paths 
(taking into account the modified initial and final states for the boundary 
condition) for the substring wu.^ n \. 

These automata have the property that the power series they recognize 
are shift invariant in the sense demonstrated in the following Lemma: 

Lemma 18 (Behavior shift invariance). For any bidiverging automaton A 
and biinfinite word w let w^ k := w^k}- Then for all i,j G Z and k,n G N 
we have that A(w, i, n) = A(w~* k , i + k,n). 

Proof. Follows directly from the definition. □ 
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As with converging and diverging automata, the behavior of bidiverging 
automata is a homomorphism. 

Lemma 19 (Behavior is homomorphism (for bidiverging automata)). Given 
bidiverging automata A and B over some semiring S and scalar values a, (5, 
7, 5 eS we have that \\aAy + pB6\\ = a \\A\\ 7 + p \\B\\ 5 

Proof. Follows straightforwardly from the definitions just as it did for con- 
verging and diverging automata, so the proof has been left as an exercise for 
the reader. □ 



5 Bidiverging Power Series 

5.1 Bidiverging Power Series Defined 

In the previous section we introduced bidiverging automata, which are a two- 
way generalization of diverging automata. In this section we shall likewise 
introduce bidiverging power series, which are a two-way generalization of 
diverging power series. 

We again let S be a semiring and A be an alphabet. We then define 
S ZxN (^A<>y) to be the set of all power series over A* with coefficients in S ZxN , 
which we shall call bidiverging power series. As before, we shall use function 
notation, so if v G S ZxN ((^)) , w G A 1 *, i G Z, and n G N, then v(w,i,n) is 
equal to position (i, n) of the coefficient on the word w. Note that bidiverging 
power series, unlike diverging power series, do not have a prefix argument. 

We endow bidiverging power series with an S'-semibimodule structure 
that is consistent with the ^-semibimodule structure with which we endowed 
S N : for all x,y G S ZxN (^A^ we have that addition is given by x + y := 
(w, i, n) 1 — y x(w, i, n)+y(w, i, n), for all s G S we have that left-multiplication 
is given by sx := (w,i,n) h-> sx(w,i,n) and that right multiplication is 
given by xs := (w,i,n) h-> x(w,i,n)s, and the additive identity is given by 
(w,i,n) !->■ 0. It is easy to see that the semibimodule laws hold, making 
S« xN ((A^)) an g-semibimodule. 

5.2 Rational Bidiverging Power Series 

There are two basic ways in which we shall construct bidiverging power series 
from other kinds of power series. The first way to build a bidiverging power 
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series from another kind of power series is infinite iteration, denoted by ^, 
which is defined as follows: Let s be a converging power series which rejects 
the empty word. Then 

s c (w,i,n) := s*(w [i:i+n] ) ■ p ( (w,s*) 

where p^iw^x) is 1 if there exists an integer i such that w\ i:oo ] has infinitely 
many prefixes in the support of x and an integer j such that w^^j] has 
infinitely many prefixes in the support of ^F", and otherwise. 

The second way to build a bidiverging power series is conjoining, denoted 
by which takes three converging power series and forms a bidiverging 

power series as follows: Let x,m,y G S{(A*/{e})). Then the conjoin of x, m 
and y is given by, 

(x * m -k y){w,i,n) := 

[x*(Wii:i+j]) ■ P* X *) 

3=0 k=j 

■ m(w\i+y.i+j+k]) ■ [y*(w [i+j+k:i+n] ) ■ p* (w,y*)] 

where p*(w,s) is defined to be 1 if there exists an i such that W[ i:oo j has 
infinitely many prefixes in the support of s, and otherwise. 

Having defined these two ways of building bidiverging power series from 
other power series, we shall now define rational bidiverging power series, 
9\ai^(S, A), as the smallest set such that 

1. 9tatf(S, A) is closed under finite sums; 

2. fRatf(S,A) is closed under left- and right-multiplication by elements 
from S; 

3. for all z G 9\at* (S , A) / S ((e)) , z< e 9iat c (S, A); and 

4. for all x,y,me M*(S,A)/S((e)), x*m*y G <Kat f (S, A). 

As with diverging power series, there is a simple characteristic form for 
this set, as shown in the following Lemma. 

Lemma 20 (Characteristic representation for bidiverging power series). A 
bidiverging power series p G 5* ZxN ((^)) is rational if and only if there exist 
finite index sets I and J and sequences {ai,bi} ieI C S, {xi,yi,mi} ie i C 
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Mat* (S, A) /S ((e)), {lj,rj} je j C S, and {zj} jeJ C Mat* (S, A)/ S ((e)) such 
that 

p = S ^2a i {x i -km i -k y,j)bi + ^ Ijzjrj. 
iei jej 

Proof. Given the /, J, {a u bi} i£l ,{xi, m h yi} ie i, {lj,rj} je j, and {zj} jeJ de- 
scribed in this Lemma, it is easy to see that Yliei a,i(xi~km i ~ky i )b i + J2j(zj lj z j r j 
is rational, so the details are left as an exercise for the reader. 

Now let X be the set of bidiverging power series which can be written in 
the form described in the Lemma. Observe that: 

1. X is closed under finite sums. 

2. Because each term has a left- and right-accumulator for the semiring 
values it is easy to see that for all l,r G S and v G X we have that 
Ivr e X. 

3. For all z e ttat*(S,A)/S((e)), z ( G X. 

4. For all x,m,ze Vtat* (S , A) / S ((e)) , x-km-ky G X. 

The set X therefore contains llHat^S', A), and so we are done. □ 

The set of recognizable bidiverging power series, fRtc^(S, A), is equal to 
the set of power series that are the behavior of some bidiverging automaton, 
and as with diverging automata and power series, bidiverging automata and 
power series are related by a Kleene Theorem. 

Theorem 3 (Kleene's Theorem for bidiverging power series). 

<Rat<(S,A) =^ec c (5,A) 

5.3 Bridge Automata 

Before proving Kleene's Theorem we will first prove some Lemmas. As was 
the case with diverging automata, it will prove useful to start by analyzing 
some special cases. The first we call a bridge automaton, which is defined 
to be an automaton such that there is exactly one initial state and exactly 
one final state which are not the same state and both have weight 1. One of 
the reasons why these automata are special is because they can be formed by 
conjoining, which we now define. Let X, M., and y be normalized automata; 
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then the conjoin of X, Ai, and y is an automaton denoted by X * Ai * y 
which is defined as follows. First, let A and B denote respectively the roll 
of X and y. Furthermore let 1 be the loopback state of A, 2/3 be the 
initial/final state of Ai, and 4 be the loopback state of B. Then Q x * M * y := 
(Q A UQ M U Q B )/{2, 3}, If * M * y := S a , F** M * y := 5 l4 , and 



M 



x*M±y ._ 



(M A 






i,jeQ M /{2,3} 




i,jeQ B 




t = l,jeQ M /{2,3} 




j G Q M / {2, 3}, j = 4 




otherwise. 



That is, X and y are rolled and merged with A4, with the initial and final 
states from M. being deleted and their edges redirected to respectively start 
on the initial state and end on the final state. 

The next Lemma shows that conjoining also has the nice property that 
the behavior of the conjoin is the conjoin of the behaviors. 

Lemma 21 (Behavior of conjoin is conjoin of behavior). Let X , Ai, and y 
be normalized automata. Then \\X * J\4 *y\\ = \\X\\ * \\Ai\\ * \\y\\. 

Proof. By Kleene's Theorem [2] and Lemma E] we know that there exist 
x,m,y E S{(A*/{e}}) such that \\X\\ = x, \\M\\ = m, and \\y\\ = y. Let 
A and B be the respective rolls of X and 3^- By Lemma [10] we know that 
= x* and ||£>|| = y*. 
Now let w be a biinfinite word, i an integer, and n a natural number, and 
let us consider the value of || X * M. * y\\ (w, i, n). Observe that by construc- 
tion every viable path has to start on the loopback state in A and from there 
pass through states only in A until it lands on the loopback state of A for 
the last time, after which it moves into Ai and passes through states only in 
there until it eventually it lands on the loopback state of B, after which it 
passes only through states in B until it ends on the loopback state of B. Now 
consider the set of all paths that land on the loopback state of A for the last 
time on the j th step and on the loopback state of B for the first time after 
step j on the k th step. Because all paths in this set land on the same steps at 
the loopback state in A for the last time and at the loopback state in B for 
the first time after the last time landing on the loopback state in A, we can 
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factor the sum over all these paths into the product of sums over paths in A, 
Ai, and B for the respective words W[ i:i+ j], W[ i+ j ]i+ j + k] and W[i + j+k:i+n]\ since 
these sums are equal to the value of the behavior at the word for these three 
automata we therefore have that these three factors are equal to respectively 
x*(w[i. A+j] ), m(w[i +j:i+j+k ]), y*(w [i+j+k . A+n] ). 

Now, note that the left boundary condition will be satisfied if and only 
there exists an i such that ferZ^j visits the loopback state in the reversal of 
A infinitely often. By Lemma [TT1 we know that such an % will exist if and only 
if W[-oo-i] has infinitely many suffixes in A = x* . Thus we conclude that 

a factor of p* ^"W", x*^j suffices to ensure that the left boundary condition is 

held, and by similar reasoning we conclude that a factor of p*(w,y*) suffices 
to ensure that the right boundary condition is held. 

Putting all of these results together and summing over j and k, we see 
that 



n n 



\\X*M *y\\ (w,i,n) := 
n n 

[ X *( W [r-i+j]) ■ P 
3=0 k=j 

■ m(w [i+j:i+j+k] ) ■ [y*{w [l+j+k .. i+ri] ) ■ p* (w,y*)] 
= (x-km * y)(w, i, n) 
= ^X\\*\\M\\*\\y\\)(w,i,n). 

□ 

We now need to define a quasi-inverse operation to conjoining, which we 
shall call disjoining. Given a bridge automaton A, the disjoin of A is a triplet 
of normalized automata (X,Ai,y). Let 1 be the start state of A and 2 be 
the end state. Then: 

• X is the result of making the initial state in A the loopback state and 
then unrolling; 

• y is the result of deleting the initial state in A, making the final state 
the loopback state, and then unrolling; and 

• Mis given by Q M := Q A , if 4 := 8 a , F t M := 5 i2 , and 



M- 



M ._ 



i = 2 or j 
M-f otherwise 
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that is, the result of deleting the incoming edges on the initial state of 
A and the outgoing edges on the final state. 

The sense in which conjoining is a quasi-inverse operation is given in the 
following Lemma. 

Lemma 22 (Disjoin is quasi-inverse of conjoin). Given a bridge automaton 
A, and letting (X,M.,y) be equal to the disjoin of A, we have that \\A\\ = 

\\x*M*y\\ =■■ \\B\\ . 

Proof. We find ourselves essentially in the same setting as Lemma [HJ except 
that now B contains three copies of A connected by two shared states instead 
of two copies connected by one shared state. Observe that every viable path 
must start in the first copy (i.e., the one from X), end in the third copy 
(i.e., the one from y), and pass through the second copy (i.e., the one from 
Ai) on the way between the two. In particular, observe that only the first 
copy allows a path to land on the initial state from A as the second copy 
has all edges leading away from this state and in the third copy it is not 
present at all; thus we conclude that every path moves from the first to the 
second copy on the step following the last time it lands on the initial state. 
From there, observe that every path moves from the second copy to the third 
copy the very first time that it lands on the final state from A (that is, the 
first time after the last time it had landed on the initial state). Because of 
these facts, we conclude that for each viable path in A there is only a single 
equivalent viable path in V, and vice versa. Thus we conclude that we can 
merge the three copies of A into a single copy while preserving the behavior, 
and because this copy is isomorphic to A it obviously has the same behavior 
as A and so \\B\\ = \\A\\. □ 

Now we see the significance of bridge automata. 

Lemma 23 (Behavior of bridge automata is the conjoin of rational power 
series). All power series recognized by bidiverging bridge automata take the 
form x -km-ky for some x,m,y G Dlat* (S, A) / S({e)) . 

Proof. Suppose we are given an automaton A with the stated properties. Let 
(X,J\4,y) be the disjoin of A, which recall implies that X, M. and y are 
all normalized. By Kleene's Theorem (Theorem [2j and Lemma [6] we know 
that there exist power series x,m,y G Vlat* (S, A)/ S((e)) such that \\X\\ = x, 
\\M\\ =m, and \\y\\ = y. By Lemma [22] we know that \\A\\ = \\X*M*y\\, 
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and by Lemma I2T1 we know that \\X* M. *y\\ = \\X\\-k\\M.\\~k\\y\\ = x*m*y, 
and since x, m and y were shown to be rational we are done. □ 

5.4 Loopback Automata 

The next special form of automaton we shall analyze in the context of bidi- 
verging automata is the loopback automaton. The most important result we 
shall prove is an analogue to Proposition [2j 

Proposition 3 (Bidiverging loopback automata recognize the of ratio- 
nal power series). The set of power series recognized by bidiverging loopback 
automata is equal to {z*> : z G Dlat*(S, A)/S((e))}. 

First, we need a couple of preliminary Lemmas, analogous to Lemmas [TT] 
and[H 

Lemma 24 (Condition for satisfying the Biichi boundary condition (for 
bidiverging automata)). If A is a converging loopback automaton that rec- 
ognizes the converging power series s, and w is a biinfinite word, then w 
visits the loopback state infinitely often to the left and the right if and only if 
p c (w,s) = 1. 

Proof. We start by analyzing the infinite visits to the right. Observe that 
by Lemma [17] we know that w visits the loopback state infinitely often to 
the right if and only if there exists an integer i such that W[i :00 ] visits the 
loopback state infinitely often, and from Lemma (TT] we know that such an 
integer exists if and only if Wu :oo ] has infinitely many prefixes in s. Combining 
these results, we conclude that w visits the loopback state infinitely often to 
the right if and only if there exists an integer i such that W\i :oo ] has infinitely 
many prefixes in s. 

We next analyze the infinite visits to the left. By reversing the automaton 
and applying the same argument, we see that w visits the loopback state 
infinitely often to the left if and only if there exists a j such that w^oo-a has 
infinitely many prefixes in s . 

Putting these two results together, we see that w visits the loopback state 
infinitely often to the left and the right if and only if there exist integers % and 
j such that tOfcoo] has infinitely many prefixes in s and w^^.^ has infinitely 
many prefixes in t?", and since p ( '(w, s) — 1 if and only if integers with the 
just described properties of i and j exist we are done. □ 
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Lemma 25 (Behavior of bidiverging loopback automata). Let A be a bidi- 
verging loopback automaton such that its converging counterpart, A, recog- 
nizes the power series s* . Then \\A\\ = s s . 

Proof. By Lemma I2"4l we know that p^(w,s*) = 1 if and only if the loopback 
state is visited infinitely often to the left and the right, so because the sum 
over viable paths is the same in both A and A we have that 

\\A\\ = (w, i, n) H> s*(w[i: i+n ]) ■ p ( (w, s*) = s 5 . 

□ 

We are now ready to prove that loopback automata recognize the "> of 
rational converging power series. 

Proof of Proposition O First assume that we have been given a diverging 
loopback automaton A. By Proposition [1] we know that its converging coun- 
terpart recognizes s* where s is a rational converging power series such that 
s(e) = 0, and by Lemma 1251 we have that ||^4|| = s^. 

Now assume that we have been given a rational converging power series 
s such that s(e) = 0. By Proposition [1] we know that there exists a diverging 
loopback automaton A whose converging counterpart A recognizes s*, and 
by Lemma [25] we have that ||.4.|| = s*>. □ 

5.5 Proof of the Kleene Theorem 

We are almost ready to prove our Kleene theorem, but there is one Lemma 
left. 

Lemma 26 (Decomposition into bridge automata and loopback automata). 
For all bidiverging automata A there exists a decomposition into a weighted 
sum of bridge automata and loopback automata, i.e. a tuple (K, {l k ,r k } ke x, 
{A k } k&K ) such that \\A\\ = J2 keK h \\A k \\ r k where K is an index set, {l k , r k } k&K 
is an indexed set of coefficients in the underlying semiring S, and {A k } k£ K is 
an indexed set of automata each of which is a bridge automaton or a loopback 
automaton. 

Proof. This proof has the exact same form as Lemma [13l but with the role 
of loopback automata with prelude replaced by bridge automata. Given this, 
the details have been left as an exercise for the reader. □ 
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Finally we are ready to prove our Kleene Theorem for bidiverging power 
series. 

Proof of Theorem^ First, assume we have been given an automaton A. By 
Lemma [23 we know that there exists a decomposition of A into a weighted 
sum of bridge automata and loopback automata. By Lemma [23] and Propo- 
sition [3J we know that both kinds of automata have rational behaviors, so 
because a weighted sum of rational bidiverging power series is also rational 
we have that \\A\\ G Mat s (S, A). 

Now assume that we have been given a rational bidiverging power series 

p. By Lemma I2U1 we know that p = a i( x j* m i*2/i)^+X]jeJ ^i z< ] r i ^ or some 
sequence of rational diverging power series {xj, m$, jjiji^i C 9iat*(S, A) /S((e)), 
some sequences of semiring elements {cti,bi}i e i C S and C S, and 

some sequence of rational converging power series {zj}j e j C Dlat* (S, A) / S((e)) . 
By Corollary [1] we know that for every % G I there exist normalized con- 
verging automata X iy M. i and such that ||A£|| = x i} = ru^ and 
||3^i|| = Vi. and by Lemma I2T1 we know that \\Xi* = * H^ill * 
||3^|| = Xi-k mi-k yi. By Proposition [3J we know that for every j there ex- 
ists a bidiverging automaton, Zj, that recognizes the power series zj. Let 
A := J2iei a i(^i * Mi * y^bi + h^3 r j^ an d b ecause by Lemma [19] the 
behavior operation is a homomorphism, we see that ||A|| = p. 

□ 



6 Application: Quantum Simulation 

6.1 Background for Finite Systems 

In the previous sections we have presented formalisms for diverging and bidi- 
verging automata, but we have not shown how they can be applied to model 
relevant systems in quantum physics. We shall do so in this section. First, 
though, we need to introduce some basic concepts from (discrete^) quantum 
physics. 

We shall define a quantum system to be a finite (for now) set of configu- 
rations A. At any time it will be in a state, usually denoted by ip, which is 
a superposition of these configurations, by which we mean that ip G C{(A)). 

3 It is possible to apply similar ideas to systems with continuous degrees of freedom — 
see Ref. [25] for an example — but that is outside the scope of this discussion. 
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Before proceeding, it is useful to define what it means to take the dual 
of ip- The dual operation for quantum states is denoted t G <C((A)) — > 
(C((A)) — > C) and is defined by (X)c»ai)t = X] c i a « rl where q G C, ^ e A, 
and for all a^a^ G A we have aj 1 (ak) = 5jk- Given the dual operation, we 
define the normalization of ip as \ip\ 2 := if)^(i/)). The quantity \ip(a)\ 2 /\ip\ 2 
gives the probability of observing the configuration a if the system is mea- 
sured (in the A basis) El After measurement, the state of the system is said 
to have been collapsed into configuration a as at that point ip = a. 

Part of what makes quantum mechanics interesting is that there is more 
than one way to measure a quantum system. For example, consider a single 
particle with a quantum spin which can be in the 'up' configuration along 
the Z-axis, denoted by f, or in the 'down' configuration along the Z-axis, 
denoted by I, so that A := {t, I}- Possible states of this system include t> 
t + 4, 4gt —i-l and so on. Measuring the spin of the system along the Z- 
axis will collapse the state of the system into either f or I, but interestingly 
measuring the system along the X-axis will collapse the state of the system 
into either "| + J, or f — \,, and measuring the system along the Y-axis will 
collapse the state of the system into either j" + i\. or f — i 4-0 

Now, when we measure an observable quantity of the system we do not 
usually get an exact reading of the state of the system but rather there is some 
dial that we read that gives us a real number from which we can infer partial 
or total information about the state of the system. For example, when we 
measure the spin of a particle we might do so by sending it through a special 
magnetic field that deflects it upward or downward based on its spin, and 
then measure by how much it is deflected, with +1 corresponding to 'up' and 
— 1 corresponding to 'down'. For this reason, an observable quantity consists 
of two pieces of information: the indexed set of possible states to which the 

4 Sometimes when dealing with hnite systems it is simply assumed that the state is 
normalized and so there is a burden to ensure that all manipulations of the state preserve 
this property, but we take the other common approach of simply not worrying about the 
normalization as it can always be accounted for at the end of the computation. 

5 For the interested reader we mention in passing that this is an example of the un- 
certainty principle in action: By measuring along the X-axis we collapse the state of the 
system into the form t ± j, which causes f and -J. to have equal amplitude and hence to 
have equal probabilities if we measure along the Z axis. So although we now know the 
spin along the X-axis, we have maximally prevented ourselves from knowing what we will 
get if we measure the spin along the Z-axis. We get an analogous effect if we measure 
along other axes, and hence we conclude that exact knowledge of one axis ensures maximal 
uncertainties of the other axes. 
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system might be collapsed, {^ijie/, and, for each i G i, the value Aj G K that 
will be observed if the system collapses into that statejj Both of these pieces 
of information can be stored within a single operator O := Xipiip} that is 
an endomorphism (linear operator) over the configuration space C((A)) with 
the property that for every i G / we have that O(V^) = Xi-ipi — that is, O has 
an eigendecomposition into eigenvalues, {Aj} ig /, and associated eigenvectors 
or eigenstates, {ipi}iei- It is easy to see that for all states x and y we have 
that (x* o 0)(y)* = (y^ o 0)(x), and so O is self-adjoint. We shall say that O 
lives in the space C((A — > A)) where, for all a« — > aj G (A — > A) and at G A 
we have that (aj — > aj)(ak) = cij5ik- 

With the observable O (by which we shall mean the operator representa- 
tion of the observable described above) in hand, and given an arbitrary state 
ip, there is also another useful piece of information we can calculate which 
is the expected value of O, given by (^ f o 0)(4>); this gives us the average 
value that we would expect to see over repeated experiments with the sys- 
tem reinitialized to ip each time. Part of the reason that this quantity is so 
important is because we do not always know the eigenvalue decomposition of 
O and so, for example, randomly generating many states and computing the 
expected value of O can provide estimates of the maximum and minimum 
values of the observable. 

Three observables appear so often that they are worth mentioning here; 
they are the three Pauli spin matrices, X, Y, and Z, which correspond to the 
observables for the spin along the respective X-, Y-, and Z-axes. Recalling 
that the eigenstates of X were (after normalizing) ^(t + 4) an d ^75 (T — 4)> 

we see that X = |([t + I] ^ [t + I]) - |([t - I] -> [t - I]) = (t-4 
) + (4,— 7-t)- Following a similar process for the other operators we obtain 
Y = -i(t-4) - z(|^t) and Z = (t^t) - (>k4)- 

Another observable that is very important is the hamiltonian, as it both 
defines the energy observable of the systems and also completely specifies 
how a state evolves over time in the following way: If H = Eiipiipj is the 
hamiltonian of a system (where the values Ei are the energies) then [/(At) = 
Si e~ lEi/lt il)iil)l is the endomorphism that takes an arbitrary starting state 

6 If multiple configurations have the same value then if that value is measured the 
system has collapsed into some superposition of these configurations. 

7 In principle the first factor should be e - l ( E i/ h ) At where 1/h is effectively a unit conver- 
sion factor from energy to temporal frequency, but it proves convenient in many contexts 
(such as this one) to simply assume that we are working in a system of units such that 
Ti=l. 
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and maps it to the state of the system after At time has passed. (Note that 
this operator is independent of the starting time.) 

6.2 Application to Biinfinite Systems 

Up to now we have assumed that we are working with a finite system, but 
it is often incredibly useful to study systems that are infinite in extent. The 
reason for this is that it gets rid of the boundaries on the sides of the system 
by making them be infinitely far away. This allows us to study the bulk 
behavior of the system without having the boundary effects mixed in. This 
is useful not only because it makes it easier to understand what is going 
on by isolating out one of the kinds of behavior, but also because real-life 
systems tend to be almost 'infinitely large' given that they have on the order 
of Avogadro's number of particles (~ 6.02 x 10 23 ) so that the vast majority 
of the material behaves as if it were in an infinitely large system. 

Thus, we now say that the set A contains the set of configurations not 
for the full system, but only for a single site of the system. At this point 
we are going to assume that we are studying a system in a single dimen- 
sion. Obviously this is being done right now because it connects with the 
formalism presented in this paper, but it is also the case that the study of 
one- dimensional systems in physics is quite common. There are a couple of 
reasons for this. First, systems with multiple dimensions are still very diffi- 
cult, and so one- dimensional versions of a system give an approach that may 
glean some useful information, or at the very least provide a useful eventual 
contrast that shows how phenomena change when the number of dimensions 
increase. Second, there are many real-life systems that can be treated as 
being one-dimensional for various reasons, such as narrow tubes where the 
interactions not along the axis are negligible. 

So given that we have a one- dimensional biinfinite system, its configu- 
rations are given by A^, and naively its state space would be C((A<)), but 
the problem with this space is that computing the normalization and the 
expected value of an observable are in general not possible as the sums won't 
converge. Thus, we must find a subspace within this space such that we can 
make them converge. One possibility is to work within a von Neumann ten- 
sor product space (also called an "incomplete" tensor product space) which 
is essentially the maximal subspace of C^A^ that is a Hilbert space (see 
Ref. [27J ). Unfortunately this subspace is restrictive and does not allow us 
to use many basic but important operators such as I * Z * I, which is used 
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to define a magnetic field or to measure the magnetization. 

Fortunately this entire paper has described an alternative solution to this 
problem — rational bidiverging power series. That is, we let the state space 
live in D\at c (C,A) C C ZxN «^)). Because of this, all states have equiv- 
alent representations as automata which give us efficient ways to compute 
representations of the normalization and expected values. 

To define how to calculate these values, we first shall first define how 
transducers work. First, let 2lut(A) be the set of bidiverging automata over 
A and C. Now let O G Qiut(A ->■ B) and A G 2lut(A) be bidiverging 
automata. Then 0(A) is given by 



o(A) 



Q 



t O{A) 

F 0(A) 

M 0(A),b 
(i,j),(k,l) 



= Q°xQ A 
= I°I A 

» 3 

= F°F A 

1 3 



J2 M ^ bM i 

aeA 



A,a 
Im ' 



Now that we have transducers, we define the dual operation as simply 
mapping every a G A to G {0}, i.e. so that if A G 2lut(A) then A 1 " G 
2lut(A ->■ {0}). In particular, if B G 2lut(A) then C := A ] {B) G 2lut({0}). 
Thus we see that the dual transducer of an automaton has the effect of 
essentially mapping all words to scalars, just as the finite definition did. 
Note that the input language of C is {0 5 }, which both consists of only a 
single string and is completely invariant under shifts, so we can effectively 
ignore the word and position arguments and treat C as a map from natural 
numbers to complex numbers, i.e. C(w,i,n) = C(n) for all w G {0 5 } and 
% G Z. 

Because bidiverging automata directly correspond to bidiverging power 
series, all of the operations we have just defined can be lifted to act on 
bidiverging power series. Specifically, for any power series a, b G <Kat c (C,A) 
we define a'(fe) : N — > C by a' (b) :=m4 A*(B)(n) where ||^4|| = a and ||£>|| = 
b, and if o G 91at c (C, A ->■ B) then we define o(a) := ||C?(-4)|| where ||C|| = o. 
These operations automatically give us a well-defined normalization, but for 
expected values it is useful to clarify that the ratio of the two maps should 
be taken pointwise — that is, given an endomorphism o G 9fat^(C, A — > A), 
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the expected value of ip e JKat^(C, A) is defined to be 

(^t 00 ) W ( n ) 

At this point it might not be obvious how much we have gained. It is true 
that we have found a subset of bidiverging power series where normalizations 
and expected values are well-defined, but in the process we have paid three 
prices: first, we have required that our operators live in 9iatf(C,A — > A), 
second, our observable values are now sequences rather than real values, 
and third, our states are forced to live in a restricted space that will in 
general not contain the actual physical states. Fortunately, the first price 
turns out to be fairly low one because most operators that physicists care 
about turn out to be exactly representable as bidiverging power series over 
endomorphisms. For example, the average magnetization of a system is given 
by the sum over terms where every term has the Z operator at one site and 
the identity (/) at the rest so that the bidiverging power series takes the 
form I -k Z -k I . (This might look like it only generates a single term, but 
remember that whenever the coefficient is evaluated it effectively sums over 
all possible shifts.) Furthermore, those interactions that can't be represented 
exactly can usually be approximated fairly well, as we shall see later. 

The second seeming price — that of having a sequence in the place of a 
scalar value — is actually a boon instead of a bane. Consider, for example, 
the energy of an infinite system. Obviously the total energy of the system is 
going to be infinite in general, but knowing this is not particularly helpful. 
What is helpful instead is knowing how the energy grows with the size of the 
system, and this is exactly the information that is encoded in the sequence! 
That is, because the value of the expected value at position n is exactly equal 
to sum over all paths that have length n, it naturally has the interpretation 
as the expected value of any collection of n contiguous sites of the system 
in the absence of boundary effects, or alternatively as the component of the 
expected value in a system of size n that is due to bulk behavior rather than 
boundary effects. 

For an example of how the expected value works in this way, consider 
a quantum system with A = {t, 4} in the state ip —r- It is left as an 
exercise for the reader to show that = 1. Now let O := I * Z * I be 
the magnetization observable discussed earlier. The automaton for O is O 
which is given by Q° : = {1,2}, if := 5 n , F? := S a , and M® = I ■ <% + 
Z ■ 5n8j2- Because the state is normalized, the expected value is given by 
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(V> f o 0)(t/j), and it is left as an exercise for the reader to show that this 
is equal to the behavior of an automaton £ given by Q £ = Q°, I s = 1°, 
p£ _ po ^ an( j j^s _ -y _ a nd that based on this, the expected value 

of O with respect to ip is n \— > n. This makes perfect sense because every 
time a spin pointing up has been added to the system we would expect 
the magnetization to grow by a single unit. The main physical quantities 
of interest — including magnetization and energy — tend to be extensive 
quantities, which means that they grow linearly with the size of the system 
and hence have an interpretation as an energy or magnetization per site. In 
general, though, the result of an expected value will not be linear, but it is 
restricted to have the following general form, 

£ je/ A™ ■ poly» + £, e 

| y 

Ei'ei' K -poly</(n) + E,' e ji Cj'5 n ji 

where poly^n) denotes some polynomial in n. This follows from the fact 
that every matrix — and therefore the transition matrix for the expected 
value automaton in particular — is similar to a matrix in Jordan Normal 
Form, and it can be shown that raising a matrix in Jordan Normal Form 
to an integer power n results in a new matrix where every component has 
the form £ ie/ A? ■ poly^n) + J2jeJ c ^nj for some I, {(A;, polyj}^/, J, and 
{cj}j £ j. For the details, see pages 385 and 386 of Ref. [13], and specifically 
let p(X) = A n to obtain the above result. 

Finally, the third price — the fact that we are living in a restricted space 
- is not a deal breaker as long as the states in this space provide sufficiently 
good approximations to the physical states of interest, and fortunately it 
turns out that they do in practice; see the end of this section for an example 
of a simulation that illustrates this. 

6.3 Simulation Methodology 

We have now established that bidiverging power series provide a means of 
approximating quantum states in a manner that has well-defined and useful 
values for expected values of observables, but this fact would be uninteresting 
if there were not ways to find sufficiently good approximations of quantum 
states of interest. Fortunately, there are such ways, and they take advantage 
of the fact that when we study quantum systems we are often most interested 
in the ground state or states — that is, the lowest energy state or states - 
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and possibly the excited states — that is, those states just above the ground 
state energy. The reason for the focus on these states is that they tend 
to have the most interesting behavior because as the energy grows higher 
the system acts increasingly like a classical system with classical properties 
rather than a quantum system with quantum properties. 

So given that we are interested in the lowest energy states, a natural 
approach is to start by finding the ground states and then working up from 
there. To find a ground state, we take advantage of the fact that the expected 
value of the energy will never be less than the ground state energy, and 
furthermore the lower the expected value is the closer we are to the ground 
state energy and thus hopefully (but not necessarily) a ground state. Thus, 
a heuristic that turns out to be effective in practice (although it is of course 
not guaranteed to worlj^l) is to start by making some ansatz for the ground 
state — say, that it takes the form of a bidiverging power series — and then 
to adjust the free parameters to minimize the expected value of the energy. 
Once the energy has been minimized we take the resulting state to be a 
ground state (or at least, a sufficiently good approximation of it) and from 
there one can in principle find the next lowest state (possibly another ground 
state) by performing the same procedure but with a constraint that the new 
state must be orthogonal to the old state. This method is known in the field 
of physics as the variational method. 

For bidiverging power series, there are a couple of basic variational ap- 
proaches one can use. First, there is the imaginary time evolution approach 
(see Refs. [26] and [IB]). To understand how this works, it is useful to first 
recall that the operator that evolves a given state forward in time by At units 
is given by U(At) = J2i e ~ iEtAt ^l where the E i 

are the energy eigenvalues 
and the tpi are the associated energy eigenstates. Now observe what happens 
if we feed an imaginary time into U: U'(At) := U{iAt) = '^2 i e~ EiAt il>iijj i . 
The new function U' has the effect of causing each energy eigenstate compo- 
nent of the state to decay at a rate exponentially proportional to its energy, 
so by evolving a state arbitrarily far forward in imaginary time the propor- 

8 One reason why this might not work is because the structure of the energy eigenstates 
is such that there are states that have energy almost equal to the ground state energy but 
which are not close to a ground state. This does not tend to cause problems in practice, 
but interestingly it does cause problems for systems that are designed such that, say, the 
ground state encodes the solution of an NP-complete problem, which is why engineering 
such systems then cooling them down as close as possible to absolute zero does not actually 
work as a method for solving NP-complete problems. 
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tion of the state that is in the ground states can be made arbitrarily high, 
giving us a means of obtaining a very good approximation of a ground state 
from a random initial stated The primary difficulty with this method is that 
in general we do not actually have a means of applying [/(At) exactly — in 
fact, if we did then we most likely know or can easily obtain the eigende- 
composition and hence have no need for a variational approach in the first 
place. Fortunately, it turns out that there is an approximation known as the 
Suzuki- Trotter expansion^*! that allows one to systematically approximate U 
in terms of polynomials of H . The approximation can be taken to arbitrary 
order. For example, to first order in the size of the time step we have that 
U'(8t) — I — 8t ■ H + 0(5t 2 ). By combining many small time steps we have 
that U'(At) = U'(5t) At/5t « (I -5t- H) At / St , the total error to first order of 
which is proportional to At/5t ■ 5t 2 = At ■ 5t, which can be made arbitrarily 
small for any At. Thus, in practice the imaginary time evolution approach 
involves picking an order for the Suzuki- Trotter decomposition (higher order 
means more calculations and greater complexity per step but fewer steps), 
picking a 5t, and then repeatedly applying this approximation of U'(5t) until 
the number of states in the automaton grows unmanageably large, at which 
point a truncation operation is applied that attempts to find the best pos- 
sible approximation to the original automaton that uses fewer states. The 
process of alternating between applying a small time step and truncating the 
automaton to prune it to a manageable size is then continued until the state 
converges to a fixed point. 

There is another approach that uses sweeping (see Refs. [IT] and [5]). 
The basic idea behind this approach is that rather than applying a global 
transformation to the whole system until we converge to an answer we instead 
zoom in on a specific site and optimize it independently from the rest of the 
system. We do this by constructing an environment for the focused site that 
effectively takes the expected value of the hamiltonian for the infinite system 
and sums over all sites but the focused site. The end result is a matrix M 
such that (i/jj o M){ipf) = o H)(ip) is equal to the expected value of the 
hamiltonian for the state of the entire system ip e £Hat^(C, A) as a function 

9 Actually, if by some horrible accident we start with a state that has zero overlap with 
any ground state then this is not true, but this is a low probability event and furthermore 
it can be mitigated by trying several initial random states and keeping the lowest energy 
one. 

10 Trotter figured out a first order approximation in Ref. [24] . and Suzuki generalized 
this idea to generate approximations at all orders in Ref. [23]. 
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of the focused site ij) f G S Q * xQ ^ ((A)) . Because M is a small, bite-sized 
matrix, we can (relatively) easily solve for its lowest energy eigenstate^j] and 
substitute it for ipf, thus reducing the energy of the entire system. After 
doing this, we then absorb ipf to the left or the right by making a copy of it 
and expanding the respective left or right sum in the environment to include 
it. We then repeat this process until we have converged to a fixed point, and 
then we increase the number of states in the automaton and then repeat the 
whole process until a fixed point has been reached (or we run out of memory). 

Once a ground state has been found, the ability to compute expectation 
values means that one can perform many kinds of analyses on it. For example, 
we can compute the magnetization, and we can also compute a correlator, 
which is an operator of the form I~kZI k Z-kI that provides information about 
how likely a particle at some arbitrary site i is to agree with the particle at 
site i + k + 1 if both particles have their spin measured along the Z axis. 

6.4 Proof of Concept 

To illustrate an example of simulating a quantum system, we consider the 
Haldane-Shastry model (see Refs. [12] and [21]), which was simulated using 
the sweep method we just discussed in Ref. [5]. This model is interesting 
for two reasons: first, it is exactly solvable, so that we can see how well 
the obtained ground state emulates the properties of the true ground state, 
and second, it involves a hamiltonian with a sufficiently non-trivial structure 
that the model provides a non-trivial test for the approaches we have been 
discussing. The hamiltonian of the Haldane-Shastry model takes the form, 
H = J2t^-ooJ2T=i&i ■ v i+r /r 2 where a = (X,Y,Z) (where X, Y, and Z 
were defined earlier). This model physically represents a biinfinite chain of 
particles with spins that interact antiferromagnetically (that is, so that they 
don't want to line up) with each other along all directions and with a potential 
that decreases with the square of the distance. Now, this hamiltonian turns 
out to be one of the rare cases we mentioned which cannot be expressed 
exactly as a bidiverging power series due to the 1/r 2 coefficient. Fortunately 
it can be expressed arbitrarily well by using a sum of decaying exponentials, 
i.e. Yli a iPI f° r some a i an d 0i- In Ref. [5] we computed approximation 

11 Technically there should also be a similar matrix N obtained by summing over all other 
sites for the normalization and we should be solving the generalized eigenvalue problem 
Mv = XNv, but in practice it turns out that we can keep the system normalized in such 
a way that N is the identity. 
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Figure 4: This plot shows the energy residual (the difference between the 
exact and approximated energies) as a function of x (the number of states 
used in the automaton, which increased over time as the solver ran) for 
the simulations run using the 3-term, 6-term, and 9-term expansions of the 
hamiltonian. [Note: This figure was taken directly from Ref. [5] for the sake 
of illustration; it was originally created by the author of this paper.] 
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Figure 5: The top of this figure plots the expansions of 1/r 2 using 3, 6, and 
9 terms, against the exact value of 1/r 2 . The bottom of this figure plots the 
correlator (which can be thought of how likely it is that two spins will agree 
as a function of distance) for states with various values of \ (the number of 
states in the automata). In both cases, the curves below the main curves 
that are tagged with epsilons are the residuals (the differences between the 
approximate values and the exact values). [Note: This figure was taken 
directly from Ref. [5] for the sake of illustration; it was originally created by 
the author of this paper.] 



56 



using 3 terms, 6 terms, and 9 terms, and Figure (top) shows that the 
approximation works reasonably well in practice, as for 9 terms it produces 
an approximation that has an error less than about 10~ 6 for distances up to 
3000 siteS 

We applied the sweep approach discussed earlier to each of these ap- 
proximated hamiltonians; for each value of Xi which is what we denoted the 
number of states in the automaton, we computed the energy. The expected 
value of the energy turned out to be a linear functional and hence could 
be interpreted as an energy per site, which matches the exact solution of 
the model. The error in the energy per site of the approximate solution (as 
obtained by comparing it to the exact solution) for each of the three approx- 
imations of H and for each value of x is plotted in Figure HJ in particular 
we see that the solution obtained using the 9-term approximation of H had 
an energy residual of only about 3 x 10 -6 for x — 200. We also computed 
the correlator for the solution obtained using the the 9-term approximation 
and plotted it against the exact value of the correlator in Figure [5] (bottom); 
in particular we see that for x — 200 the correlators match to within about 
5 x 10 -5 out to 3000 sites. This example has demonstrated that the tech- 
niques that have been discussed throughout this section do work in practice, 
allowing us to obtain and analyze very good approximations to the ground 
states of biinfmite systems. 

7 Conclusions 

In this paper we have introduced a new kind of automaton called a diverging 
automaton which explicitly captures the divergences caused by uniting infi- 
nite words with weighted automata by modeling the divergence as a sequence 
of weights. We have presented a corresponding diverging power series as well 
as natural rational operations, and proven a Kleene Theorem that shows that 

12 To get a sense of why this number is usefully large, it is helpful to know that most 
models that are studied only consider interactions between nearest neighbors or possibly 
next-nearest neighbors, and 3000 3> 2. 

More precisely, we found that in the large n limit the expected value of the energy 
turns out to be a linear function, which was sufficient for our purpose of comparing it 
to the exact energy per site of the Haldane-Shastry model in the infinite size limit. We 
computed only the large n limit of the energy because computing the full function would 
have required computing the full Jordan Normal Form of the expected value's automaton's 
transition matrix, which would have been expensive. 
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the set of (prefix-free) rational diverging power series is equal to the set of be- 
haviors of diverging automata. We have furthermore presented extensions of 
these ideas to biinfinite words, resulting in biinfinite automata and biinfinite 
power series with, of course, another Kleene Theorem connecting the first to 
the rational subset of the second. Finally, we have demonstrated the useful- 
ness of these constructions by showing how rational bidiverging power series 
are very important in quantum simulation due to their ability to provide a 
powerful means of approximating the states of biinfinite quantum systems. 

There are at least two obvious directions for future research. First, it 
would be good to find a theory that generalizes and unites the theory we have 
just presented here with the theory of Conway *-semiring — ^-semimodule 
pairs, just as the latter provided a generalization that united weighted lan- 
guages with infinite languages for a subset of semirings. Second, because 
people tend to be interested in systems with more than a single dimension, 
it would be useful to extend the formalism presented in this paper to power 
series over pictures (see Ref. [16]) which, like bidiverging power series, also 
have useful applications in quantum simulation (see Ref. [33]). 

Finally, it is worth noting that we have demonstrated something very 
important here, which is that there is a significant link between automata 
theory and a family of techniques in quantum simulation. It is a hope of 
the authors that this link will benefit both fields of research by leading to 
cross-fertilization of ideas between them. 
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